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Library components are composed of devices, symbols, and footprints which 
are the fundamental parts to design and develop printed circuit boards 
(PCB). In archiving PCB libraries, the creation and checking of the 
component’s parameters are very important. The checking of footprints and 
pin configurations from datasheets are manually done which causes delays 
and downtime when the components do not fit the developed PCB. It was 
found in a survey that 61.5% of errors come from footprints dimensions. 
These cause delays in the production and development of PCBs. In this 
study, a parameter extractor and verifier system were developed to mitigate 
the error of manual process of archiving library parameters from datasheets. 
Text recognition and pattern matching algorithms were used for the 
processing of images from datasheets. The 10-trial tests conducted for each 
footprint for the extraction of text from the cropped images for standard 
operating procedures (SOP) series and heat sink thin shrink small outline 


package (HTSSOP) series were found successful without any error. The 
checking of footprints and pin configuration was reduced from 45 minutes to 
25 minutes. The developed system was evaluated based on user perception 
by 10 PCB library users which resulted to agree that the system is functional, 
efficient, and convenient to use. 
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1. INTRODUCTION 

The evolution of the design and development of printed circuit boards (PCBs) emerges with the 
changes in integrated circuit (IC) packages and other electronic devices [1]. The footprint and pin 
configuration on PCB designs from pin-holed components change as surface-mount and lightweight 
components evolve. Manufacturers have developed library systems for easy configuration of devices from 
those available locally and online. To reuse components, library systems are developed [2]. This is also 
applied to electronic components archiving and development of PCBs [3], [4]. 

The electronic circuits’ PCB design undergoes three stages namely, libraries, schematics, and board. 
The fundamental part in PCB design is the library. In the library can be found devices, symbols, and 
footprints of electronic components such as ICs [5], [6]. The components’ footprints and symbols are 
designed with schematics. The footprints are the arrangement of pads that are physically and electrically 
connected which are composed of parameters and details that help the designer to identify easily which parts 
need to be modified. The board plays an important role where the IC footprints [7] must be fitted well. Even 
in the simulation for the development of micro-electro-mechanical (MEMS), component libraries must be 
present [8]. The footprint components’ information must be in place even when designs vary depending on 
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the manufacturer [9]. This information, including the manufacturer’s name, can be included as parameters in 
the datasheets for package classification [10]. 

Standardization of electronic component footprints and other features must be established in order to 
allow designers optimize their PCB designs and schematics [11]. However, the uniqueness of components 
being developed may vary because of diverse applications. Datasheets and drawings come in different 
formats from different electronic components manufacturers. In order for these products to be archived in a 
library, the user needs to input it manually by reading the data sheet and encoding all the parameter values of 
the foot print and pin configuration. Human error always exists in manual processes like this. 

A survey was conducted in one of the leading electronic components manufacturers in the 
Philippines which shows that 61.5% of the PCB designers and librarians agreed that oftentimes, the encoding 
of the dimensions of the footprints are the common sources of errors. Dimensions are critical in the design of 
PCBs because the copper track layout and spacing between pads rely on it [12]. The inspection of PCBs 
relies on the data archived in the library. It is time consuming and considered a downtime when the 
parameters did not match with the PCB designed and developed which can turn out into waste [13]-[15]. 
Visual component libraries play a vital role for optimized designs minimizing errors and cost in PCB 
development [16], [17]. 

This paper presents a method of automating the data parameters extraction and verification from the 
data sheet drawings for archiving it in a library of electronic components. An image processing, text 
recognition, and pattern matching algorithm were used to extract the parameters from an automatically 
cropped image from data sheet drawings. In this way, the error and time consumed in checking footprints and 
pin configuration will be lessened. The developed system is a web pack running in a local network. 


2. METHODOLOGY 
2.1. The IC PCB footprint and pin configuration parameter extractor and verifier system (PEVS) 

The process of the parameter extractor and verifying system is presented in Figure 1, which has five 
parts: the load image (LD) for image acquisition; extraction using the application protocol interface (API) 
commands; classification; optimization; printing and displaying. The extraction undergoes three 
subprocesses: the preprocessing for filtration and binarization of image, the segmentation for character 
recognition, and the feature extraction. The image post processing output will be fed to the library or can be a 
printable and readable display. 

Text recognition technology has been widely used since it was discovered. Optical character 
recognition (OCR) has been extensively developed using different technologies [18]—[23] even for much 
more advanced techniques [24]—[26]. In this system, tesseract, which is an open-source OCR engine, is used 
because of its excellent capability of recognizing six western languages. TesseractJS is a plugin integrated in 
the system which can run in a browser or server with NodeJs. 

The PEVS has three display windows: the main window, the middle, and the training window. In the 
main window, the user can view the result of the two images being compared, where the image post- 
processing output is displayed. In the middle window is where the training datasets or images can be viewed. 
In the training window is where the training is done by extracting the parameters from the image. It is where 
the pdf files are loaded, cropped, and converted into an image file. 


2.2. Training and verification setup 

It is imperative that the character recognition must be set up well. VUE.js, which has an open-source 
software is used for building user-machine interface. Within a single page, these interfaces can be viewed. By 
building this framework, the PDF file will be loaded to the system for parameters training. It will be cropped 
and converted into an image file format. The pattern matching algorithm is used to look for a reference in the 
drawing to identify points in the edges for cropping. The region of interest which are the blue rectangles will 
be set up in place for character recognition. 

Tesseract is used as a plugin for text recognition and image processing. It also has a classification 
system in which the images are compared with the pre-defined patterns for parameter detection. Once done, 
the trained setup will be saved. See Figure 2 for the pseudo code. The original and the process image are 
being displayed in a viewport for verification. The platform used as a front end is made using hypertext 
processor (PHP) along with vue.js for web development in a local network. The results will be shown in 
percent similarity. 


2.3. Time Study 


Automating the processes can also be evaluated through time studies, where the time spent for the 
automated process is being compared to the previous process. The time spent for processing one data sheet 
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for parameter extraction and verification for both footprint and pin configuration were recorded and 
compared. The results must verify that the downtime is reduced. 
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Figure 1. Process flow chart of PEVS 


Pseudo Code for Datasheet Parameter Extractor System 


1: Initialize package PN = package name and PD= package description 
2. Package = (Set Initial Package Value:); 

J Display Responses; 

4. "Package_Name:" 

Sz "Package Description:" 

6. { 

T Case 1: //For Package name 

8. IF Package name = 1 (true), proceed 

i (Package Name Exist); 

10 ELSE package name = @ (false), proceed 

ii. Set the Package name edit the value 

12. Case 2: //For Package description 

ia IF Package description = 1 (true), proceed 
14. (Package description); 

15. ELSE Package description = @ (false), proceed 
16. Set the Package description edit the value 
17. } 

18. Choose and Load PDF file = Data storage 

19. Select available PDF files, process 

20. Initialize PDF file 

Ji. { 

22. IF 

23. Load PDF file isn't recognized go to step 18 to Choose file 
24. print response, "The file you choose doesn't response" 
25. ELSE 

26. PDF training, proceed 

rr scrape, getting position to crop 

28. Print "Display image output" 

29. } 

30. Input Parameters one at a time 

31. Add region of rectangles at the Display Image 

32. Training of parameters done 

33. Save Training ( to Data stored) 

34. Select images from training data stored, decision 

35. Print response “Display responses of images" 

36. "Image 1" = "Created footprint" 

I = "Datasheet" 

38. Comparing of 2 images and its parameters 

39: Print response "Result Shown" 


40. All data save at Data stored 


Figure 2. Pseudo code of the developed PEVS 
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3. RESULTS AND DISCUSSION 
3.1. PVES development 

The developed system is deployed in a local network and has a package display window as 
presented in Figure 3. The package name and description of the specific IC or electronic component will be 
needed. For testing purposes, three types of library components were used, namely the heat sink thin shrink 
small outline package (HTSSOP) series, standard operating procedures (SOP) series, and VQFP48C series. 
These are integrated circuits which have different pin configurations and footprints. 

Once the package is set, it will be loaded and cropped for processing the image. The pattern 
matching algorithm will identify the position of the drawing and crop it accordingly with an identified size. 
When cropped, the rectangles are positioned as regions of interest for character recognition. It will be 
converted into an image file and saved in a database for archiving. The cropping and converting images are 
presented in Figure 4. The datasheet is being saved first, as presented in Figure 4(a). In Figure 4(b), the pdf 
file was loaded. It undergoes pattern matching for cropping into an image file which in result is now ready 
for text recognition as presented in Figure 4(c). 
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Figure 4. Cropping and converting of pdf file; (a) datasheet saving, (b) loading and cropping, 
and (c) converted pdf zinto an image file 


3.2. Parameter extraction 

The extraction of parameter values from the cropped image is done by putting rectangular boxes as 
regions of interest for text recognition. During the training stage, the parameters are written manually and the 
rectangular boxes are positioned manually as well. When successful, the system will automatically analyze 
the data sheet fed for analysis. 
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The parameter names of the components are put in the command box. All the parameters set by the 
operator will be added to the classification system as presented in Figure 5. Figure 5(a) presents an example 
of the extraction results. In this case, there are 10 parameters listed and saved for archiving. Each designated 
rectangle on the cropped image has designated parameters, such as inner gap, pad length, pad width, heat tab 
width, heat tab length, pitch, via pitch, hole size, and body length. For electronic packages with 5 parameters, 
other parameters can be simply added or deleted in the command box, as presented in Figure 5(b). 

As an example, the dimension parameters are user-defined circuit dimensions, which are critically 
identified because it allows the analysis engine to modify the circuit to perform parameter checks and 
optimization. It provides a faster way for the designer to change dimensions in the project editor, PCB 
Cadence, as shown in Figure 6. The parameters identified will also be used for verification. 
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Figure 5. Parameter extraction (a) parameter command window and (b) 5-parameter settings with assigned 
region of interest 
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Figure 6. Parameters in a footprint by PCB cadence 


3.3. Verification 

The verification system has the following functions: compare, set, and search functions. The 
preliminary test result for footprint parameters comparison between SOP8 products from different 
manufacturers matches identically as seen in Figure 7. The image comparison for the same product from 
different manufacturers is 71.43% simular. The verification of pin configuration for two datasheets of SOP8 
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from different manufacturers are presented in Figure 8. The two images are 71.88% similar. The texts were 
successfully recognized for the two images. 
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Figure 8. Main window with the package parameters and preliminary results for pin configuration 
comparison 


3.4. Clicks and time study analysis of manual vs. PEVS checking of footprint and pin configuration 
The number of clicks for checking footprints and pin configuration between manual and PEVS are 
recorded for comparison. The process of checking components by listing numbers of pins is recorded. The 
data are taken from the average of 10 users. The average time spent for checking IC footprints is 45 minutes 
as tabulated in Table 1. The manual checking of HTSSOP-B20 footprint, a 20-pin IC with 9 parameters is 30 
clicks. The designer spends 20.4 minutes checking it. Using the developed PEVS, an average of 22 clicks or 
10.75 minutes for the HTSSOP-B20. A total of 58 clicks were observed for checking the footprint of 
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HTSSOP-B20, SOP8, and VQFP48C using the PEVS. It is faster compared to that of the manual process 
which has a total of 72 clicks. The pin configuration checking using the manual method averages to 45 clicks 
in 45 minutes. Using the PEVS, only 35 clicks on average are made for 28 minutes. See Table 2 for the pin 
configuration checking results. A total of 120 clicks were observed using the PEVS in checking the pin 
configuration of the three components which is way faster than the manual checker with 175 clicks. 


Table 1. Datasheet footprint checking and verificatation 


Manual Checking PEVS 
Library No. of Clicks No. of Clicks 
Component 
HTSSOP-B20 30 22 
SOP8 20 16 
VQFP48C 48 20 
Total 72 58 


Table 2. Datasheet pin configuration checking and verificatation 


Manual Checking PEVS 
Library No. of Clicks No. of Clicks 
Component 
HTSSOP-B20 45 35 
SOP8 25 20 
VQFP48C 105 65 
Total 175 120 


Additional tests were conducted for a series of library components for footprint checking. The 
results of checking of SOP-series ICs are presented in Figure 9 as follows: the SSOP6 in Figure 9(a), MSOP8 
in Figure 9(b), and SOP8 in Figure 9(c). There were 10 trials conducted for each of these components. Same 
goes with the HTSSOP-series in Figure 10, where HTSSOP-B28 in Figure 10(a), HTSSOP-B20 in Figure 
10(b), and HTSSOP-J8 in Figure 10(c) were tested. The results were summarized in Table 3. The 3 
preliminary tests were conducted for training. The remaining tests determine the efficiency of the developed 
system. As a result, the SOP8 and HTSSOP-B20 fail in the first two tests. The HTSSOP-J8 also failed in the 
first test while HTSSOP-B28 fails until the third test. The succeeding test passed in which the system was 
verified to be working properly as expected. 
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Figure 10. HTSSOP-series footprint checking (a) HTSSOP-B28, (b) HTSSOP-B20, and (c) HTSSOP-J8 
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Table 3. Additional test for footprint checking for SOP and HTSSOP series 
Library Component 1 4 5 6 8 10 

SOP8 

MSOP8 

SSOP6 
HTSSOP-J8 
HTSSOP-B20 
HTSSOP-B28 
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4. CONCLUSION 

This study developed a PEVS which is able to improve the process of parameter extraction and 
verification for archiving electronic component footprint and pin configuration in a library. The average of 
the manual process which is 45 minutes for IC footprint checking was improved to 10.75 minutes, while the 
pin configuration checking and verification was improved from an average of 45 minutes to 28 minutes. 
Using the technique of image post-processing by pattern matching and text recognition, the parameters can 
now be generated from the pdf file types of datasheets of electronic components. Based on the conducted 
tests, the system was able to extract parameters from different components, the SOP8 series and the HTSSOP 
series. The inputting of parameters to the library system had mitigated errors due to the manual process. The 
PEVS was deployed in a local network so that the librarians can easily access it. The end-to-end integration 
to a library software will be done in the future to extend the application of the developed system and will be 
deployed online. 
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